Chan et al. Gut Pathogens 2013, 5:29 
http://www.gutpathogens.eom/content/5/1/29 



Sg3 Gut Pathogens 



GENOME ANNOUNCEMENT Open Access 



Insights of biosurfactant producing Serratia 
marcescens strain W2.3 isolated from diseased 
tilapia fish: a draft genome analysis 

Xin Yue Chan 1 , Chien Yi Chang 2 ' 3 , Kar Wai Hong 1 , Kok Keng Tee 4 , Wai Fong Yin 1 and Kok Gan Chan 1 ' 2 ' 



Abstract 

Background: Serratia marcescens is an opportunistic bacterial pathogen with broad range of host ranging from 
vertebrates, invertebrates and plants. 5. marcescens strain W2.3 was isolated from a diseased tilapia fish and it was 
suspected to be the causal agent for the fish disease as virulence genes were found within its genome. In this 
study, for the first time, the genome sequences of S. marcescens strain W2.3 were sequenced using the lllumina 
MiSeq platform. 

Result: Several virulent factors of 5. marcescens such as serrawettin, a biosurfactant, has been reported to be 
regulated by A/-acyl homoserine lactone (AHL)-based quorum sensing (QS). In our previous studies, an unusual AHL 
with long acyl side chain was detected from this isolate suggesting the possibility of novel virulence factors 
regulation. This evokes our interest in the genome of this bacterial strain and hereby we present the draft genome 
of S. marcescens W2.3, which carries the serrawettin production gene, swrA and the AHL-based QS transcriptional 
regulator gene, luxR which is an orphan luxR. 

Conclusion: With the availability of the whole genome sequences of S. marcescens W2.3, this will pave the way for 
the study of the QS-mediated genes expression in this bacterium. 
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Background 

Serratia marcescens is common microorganism presence 
in soil and freshwater [1]. However, the emergence of 
multidrug resistant Serratia has been alarming not only 
in the medical field but also aquaculture and agriculture 
sectors [2-4]. In 2009 an endemic disease outbreak in 
fish farms of Malaysia had killed more than 50% of the 
tilapia fish. Five bacteria strains including S. marcescens 
W2.3 have been isolated from the fish samples and been 
suspected to be the causal agent of the outbreak. 

Quorum sensing (QS) describes bacteria community 
gene regulation by cell-cell communication through the 
production of QS signalling molecule [5]. S. marcescens 
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which is taxonomically classified as Proteo bacteria pro- 
duces A/- acyl homoserine lactone (AHL) as QS signal mol- 
ecules. Its AHL-based QS system plays a regulatory role in 
biosurfactant production, biofilm formation, motility, 
prodigiosin and nuclease production that contribute to 
the pathogenesis [1,6]. Unlike most of S. marcescens, 
S. marcescens W2.3 produce Af-dodecanoyl-homoserine 
lactone (C12-HSL) rather than short chain AHLs. There- 
fore, this suggests the presence of novel AHLs responding 
proteins and virulence factors that may be regulated under 
this long chain AHL-based QS in S. marcescens W2.3. 

The rapid maturation of next generation sequencing 
(NGS) technology coupling with the fast improvement 
of computing power enable researcher to map bacteria 
genome within short period of time. Annotating the 
draft genome with the aid of databases available allow us 
to look into several genes simultaneously. Here, we 
present insights of the S. marcescens W2.3 genome, 
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describing the presence of putative QS related genes and 
the serrawettin coding gene, swrA. 

Methods 

Bacterial culture 

S. marcescens W2.3 is routinely maintained on either LB 
(Luria- Bertani, BD, USA) agar plates at 37°C or culture 
for 20 hrs in broth at 28°C with 200 r.p.m shaking. 

Genomic DNA extraction 

Genomic DNA of the bacteria was extracted with 
QIAamp DNA Mini kit (Qiagen, USA) and was 
subjected to RNase (Qiagen, USA) treatment. The DNA 
was eluted with elution buffer and subjected to DNA 
quantification with Qubif 2.0 Fluorometer (dsDNA 
High Sensitivity Assay Kit) (Invitrogen, USA), and 
qualification with Nanodrop Spectrophotometer and 
agarose gel electrophoresis. The genomic DNA was 
stored in -20°C. 

Library preparation and sequencing 

DNA sequencing template was prepared with Nextera™ 
DNA Sample Preparation kit (Nextera, USA). The qual- 
ity of DNA library was validated by Bioanalyzer 2100 
high sensitivity DNA kit prior to sequencing. Upon se- 
quencing, DNA (6 pM) was loaded into the sequencing 
cartridge and the sequencing was performed on Illumina 
MiSeq platform. 

Read quality assessment 

The quality of raw sequences as well as (G + C) content 
was checked with FastQC. Raw reads were trimmed at 
Phred 30 and were de novo assembled using CLC Gen- 
omic Workbench 5.1 [7]. Trimmed sequences were as- 
sembled with length fraction of 0.8 and similarity 
fraction of 0.8. Contigs with at least 30-fold coverage 
were subjected to gene prediction using Prodigal 2.6 [8]. 

Gene annotation was performed using RAST (Rapid 
Annotation using Subsystem Technology) followed by 
visualization of the bacterial genome using GeneWiz 
Browser 0.94 Server [9,10]. In addition to RAST, 
Serrawettin genes were annotated by BLAST against 
NCBInt/nr database with e-value of 0.0001 and aligned 
with reference genes using LAST [9,11]. Phylogenetic 
analysis was performed using MEGA version 5.0 [12]. 

Quality assurance 

The 16S rDNA gene from draft genome was used to 
check for contamination. RNAmmer 1.2 Server has 
shown that only a copy of 16S rDNA gene is presence in 
the draft genome. The contig that carried 16S rDNA 
gene was annotated by BLAST against NCBI microbial 
16S database and confirmed this 16S rDNA gene be- 
longs to Serratia marcescens [13,14]. 



Initial findings 

This sequencing generated 2,724,434 paired end reads 
whilst 2,509,440 quality reads were preceded to assembly 
after trimming. The genome size of Serratia marcescens 
W2.3 is 5.3 Mbp. The draft genome of S. marcescens 
W2.3 is made up of 72 contigs with the length of at least 
500 bps with the average coverage of 60-folds (Figure 1). 
N50 of this assembled genome turns out to be 216,941 bp 
while the (G + C) content of the draft genome is 59.3%. 
Prodigal shows that this draft genome carried 4891 coding 
DNA sequence (CDS). 

Draft genome was annotated using RAST and the re- 
sult was shown in Figure 2. Similar with most other bac- 
teria, most of the genes are responsible for carbohydrates 
and amino acid metabolism with 598 counts. This top hit 
follows by amino acids and derivatives, cofactors, vitamin, 
prosthetic groups and pigment production, RNA metabol- 
ism and cell wall and capsule synthesis. These genes are 
responsible for the basic needs in sustaining the life of a 
bacteria cell. In addition to the necessary genes, there are 
105 genes responsible for virulence, disease and defense 
suggesting S. marcescens W2.3 is a pathogen. 

Serratia produces serrawettin that acts as wetting 
agent to reduce surface tension of the environment 
[15,16]. The reduction of host cell surface tension by 
serrawettin causes the rupture of host cell leading the 
success of infection by this pathogen [17]. Serrawettin 
gene, swrA was found within the genome of S. marcescens 
W2.3. This 2631 bp gene was located at contig number 6. 
Dotplot (Figure 3) from LAST and BLAST result show 
that its serrawettin gene has 94% similarity with the 
serrawettin gene S. liquefaciens (Figure 3A) and 93% 
similarity with the serrawettin gene from S. marcescens 
A88copal3 (Figure 3B). 

In a complete AHL-based QS system, the luxI/R ho- 
mologs interact with each other where Luxl type protein 
synthesis AHL and binds to the LuxR-type protein [18]. 
Subsequently, this AHL-protein complex regulates the 
expression of certain genes leading to the group behav- 
ior of the bacteria [19]. However, luxl and luxR gene 
do not always occurs in paired in Proteobacteria. For 
example Pseudomonas aeruginosa and Sinorhizobium 
meliloti have been reported to carry unpaired luxR 
gene in their genome [18,20]. These unpaired receptor 
protein coding genes does not responsible for any sig- 
nalling molecule production but they are responsive to 
the cognate signalling molecules produced by both its 
existing AHL synthase and the signalling molecules 
from the environment [21], 

A putative luxR gene with the size of 759 bps was 
identified within contig 7 of S. marcescens W2.3 draft 
genome. Phylogenetics analysis of the LuxR protein se- 
quence of S. marcescens strain W2.3 and its closely re- 
lated species shows that it is highly similar to the LuxR 
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Figure 1 Atlas of Serratia marcescens W2.3. The atlas was constructed by GeneWiz Browser 0.94 and the GC skew was shown in the fifth lane 
counting from the outer most lane. The genome of Serratia marcescens W2.3 was 5.2 Mbps. 



Subsystem Coverage 



Subsystem Category Distribution 



Subsystem Feature Counts 

Cofactors, Vitamins, Prosthetic Groups, Pigments (310) 
Cell Wall and Capsule (219) 
Virulence, Disease and Defense (105) 
Potassium metabolism (35) 
Photosynthesis (0) 
Miscellaneous (47) 

Phages, Prophages, Transposable elements, Plasmids (44) 
Membrane Transport (182) 
Iron acquisition and metabolism (66) 
RNA Metabolism (247) 
Nucleosides and Nucleotides (133) 
Protein Metabolism (243) 
Cell Division and Cell Cycle (37) 
Motility and Chemotaxis (85) 
Regulation and Cell signaling (157) 
Secondary Metabolism (8) 
DNA Metabolism (160) 
Regulons (9) 

Fatty Acids, Lipids, and Isoprenoids (152) 
Nitrogen Metabolism (49) 
Dormancy and Sporulation (3) 
Respiration (147) 
Stress Response (172) 
Metabolism of Aromatic Compounds (60) 
Amino Acids and Derivatives (525) 
Sulfur Metabolism (60) 
Phosphorus Metabolism (68) 
Carbohydrates (598) 

Figure 2 Subsystem category distribution statistics for Serratia marcescens W2.3. The pie chart present the abundance of each subsystem 
category and the count of each subsystem feature was listed in parentheses at the chart legend. 
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Figure 3 Dotplot of Serratia marcescens W2.3 serrawettin, swrA gene DNA sequence against reference genes. (A) 5. morcescens W2.3 
serrawettin gene is 94% similar to the swrA of Serratio liquefaciens (gb|AF039572) (B) and 93% similar to the swrA of Serratia marcescens A88copa 
13 (gb|JX667980). 



protein sequence of S. plymuthica (SptR) (Figure 4). 
However, no luxl gene was identified within the upstream 
and downstream of this IwcR, thus we hypothesised that 
this is an orphan luxR. In 1998, Cox et al reported a solo 
luxR of S. marcescens, carR that regulates its carbapenem 
production [22]. However, the unpaired luxR gene of 
S. marcescens W2.3 is not group into the same tran- 
scriptional protein family as carR gene; therefore more 
studies need be conducted in the future for further un- 
derstanding of the interaction of unusual long AHL 



with luxR coding in the genome of S. marcescens W2.3 
isolated from the diseased tilapia fish and its rule in 
the pathogenesis. 

Future directions 

The whole genome of S. marcescens W2.3 has shown the 
presence of virulence factor coding genes and we have 
found the complete sequence of the serrawettin synthase 
gene. Since serrawettin production of S. marcescens is 
reported to be coordinated by QS, our future work will be 
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100 I-ACE75883.1 Serratia plymuthica LuxR 
I ACR22888.1 Serratia plymuthica SpsR 
AA038761.1 Serratia liquefaciens SwrR 



£1 



- CAB92554.1 Serratia sp. ATCC 39006 SmaR 

YP 001005359 1 Yersinia enterocolitica subsp. enterocolitica 8081 LuxR 

Serratia marcescens W2.3 LuxR 



AEL33286.1 Serratia plymuthica SptR 
— ACR22887.1 Serratia plymuthica SpIR 

YP 001478064.1 Serratia proteamaculans 568 

YP 003538485.1 Erwinia amylovora ATCC 49946 ExpR 

- AAC44036.1 Pseudomonas aeruginosa PA01 RhIR 

YP 001078154.1 Burkholderia mallei NCTC 10247 BpsR 

- NP 250121.1 Pseudomonas aeruginosa PA01 LasR 

NP 903760.1 Chromobacterium violaceum ATCC 12472 CviR 

YP 002550952 1 Agrobacterium vitis S4 LuxR 



— ADM80305 1 Aeromonas sobria LuxR 

- YP 004394647.1 Aeromonas veronii B565 LuxR 

YP 007974337.1 Sinorhizobium sp. M14 LuxR 

YP 473056.1 Rhizobium etli CFN RaiR 



- YP 001076161.1 Burkholderia pseudomallei 1106a BmpR 
AAN86705.2 Vibrio harveyi LuxR 



Figure 4 Phylogenetic analyses of S. marcescens luxR gene. The tree was constructed based on the luxR protein sequences by Neighbor- 
Joining with bootstraps value of 1000 replicates. 
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focusing on the study of the AHLbased QS gene regula- 
tion of S. marcescens W2.3. 

Availability of supporting data 

This whole genome shotgun project has been deposited at 
DDBJ/EMBL/GenBank under the accession ALOV00000000. 
The version described in this paper is the first version 
ALOV01000000. 
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